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SYSTEM FOR REAL TIME TRANSMISSION OF VARIABLE BIT 
RATE MPEG VIDEO TRAFFIC WITH CONSISTENT QUALITY 




This application claims priority from Korean Patent Application 
5 No. 2000-067183, filed on November 13, 2000, which is fully 
incorporated herein by reference. 

BACKGROUND 

1. Technical Field ; 

The present invention relates generally to a video traffic system 

□ 

io and, more particularly, to a system for real time transmission of 

CO 

variable bit rate MPEG video traffic with consistent quality. 

sake 

in 

s 2. Description of Related Art : 

m It is known that variable bit rate (VBR) video provides more 

53 15 constant quality than constant bit rate (CBR) video (see, e.g., "A 
Variable Rate Video Decoder For Synchronous Transfer Mode 
Networks" by W. Verbiest and L. Pinoo, IEEE J. Select. Areas 
Commun., Vol. 7, No. 5, pp. 761-770, June. 1989; and "Packet Video, 
Modeling and Signal Processing", by N. Ohta, Artech House, 1994). 
20 The VBR MPEG (Moving Picture Experts Group) coded video traffic is 
burst by three different encoding frame types, i.e., I-frame, P-frame, 
and B-frame as proposed in "Generic Coding of Moving Pictures and 
Associated Audio: Video", ISO-IEC/JTC1/SC29/WG1 1/ NO801rev, 
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MPEG Committee Draft: Rec. H. 262, April, 1995. The VBR video is 
highly time-variant due to successive scenes having different contents 
as proposed in "Bandwidth Renegotiation For VBR Video Over ATM 
Networks" by D. J. Reininger, D. Raychaudhuri and J. Y. Hui, IEEE J. 
Select. Areas Commun., Vol. 4, No. 6, pp. 1076-1086, August, 1996. 

A video with constant quality can be obtained by transmitting an 
unconstrained VBR providing similar quantization parameter to all 
frames, such as proposed in the above reference by D. J. Reininger, et 
al., as well as "MPEG Coding For Variable Bit Rate Video 
Transmission", by P. Pancha and M. El Zarki, IEEE Commun. Mag., Vol. 
32, No. 4, pp. 54-66, May. 1994. Since an ATM (Asynchronous 
Transfer Mode) network provides limited bandwidth, however, the ATM 
network requires a constrained VBR specified by three parameters of 
peak rate, sustainable rate, and maximum burst length. Smoothing (or 
shaping) is one of the most dependable methods that efficiently 
improves network utilization and achieves constrained VBR 
transmission at an acceptable delay cost, as proposed in "Performance 
Models of Statistical Multiplexing In Packet Video Communications", 
by B. Maglaris, IEEE Trans. Commun., Vol. 36, No. 1, pp. 834-844, 
July. 1998, as well as "Analysis of ON-OFF Source Shaping For ATM 
Multiplexing", by F. Bernabei, Proc. INFOCOM, pp. 1330-1336, 1993. 

In addition to smoothing, bandwidth renegotiation is another 
important system for the transmission. However, undesirable operation 
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of the smoothing and bandwidth renegotiation may give rise to 
unexpected encoder buffer overflow, decoder buffer overflow, or 
violation of the renegotiated bandwidth, which causes unwanted loss of 
data. The undesirable loss of data seriously deteriorates the 
transmitted video quality. Therefore, a method for controlling a joint 
encoder and channel rate is needed which considers an allocated 
bandwidth and sizes of the encoder and decoder buffers, such as 
proposed by "Constraints on Variable Bit Rate Video for ATM 
Networks", A. R. Reibman and B. G. Haskell, IEEE Trans. Circuit Syst. 
Video Technol., Vol. 2, No. 4, pp. 361-372, December. 1992; and "Joint 
Encoder and Channel Rate control ofVBR Video over ATM Networks", 
W. Ding, IEEE Trans. Circuit Syst. Video Technol., Vol. 7, No. 2, pp. 
266-278, April, 1997. 

Reininger et al. proposes method for bandwidth renegotiation 
without traffic smoothing, and an method for controlling encoder rate 
corresponding to negotiated traffic parameters. Reibman, et al. 
discloses that constraints imposed on the encoded video bit rate (i.e., 
encoded video transmission rate) are caused by encoder and decoder 
buffering. Ding proposes a method for traffic smoothing, joint encoder, 
and channel rate control, wherein the negotiated peak rate is very large 
while the employed peak rate is very small. That is, the allocated peak 
rate is 4 times higher than the rate to have been maintained (about 180 
kbits/frame) if the used peak rate is only half of the allocated peak rate. 
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In the methods disclosed by Reibman, et al. and Ding, traffic 
parameters for bandwidth renegotiation and rate control cannot be 
obtained by on-line processing. Some techniques for traffic smoothing 
for VBR MPEG video are disclosed in the following references: 

5 "Source Traffic Smoothing and ATM Network Interfaces for VBR MPG 
Video Encoders", by K. Joseph and D. Reininger, Proc. ICIP, pp. 1761- 
1767, 1995); "Adaptive Prediction and Smoothing of MPEG Video in 
• ATM networks", by S. Jung and J. S. Meditch, Proc, ICC, pp. 832-836, 
1995); and "An Algorithm for Lossless Smoothing of MPEG Video" by S. 

io S. Lam, S. Chow and D. K. Y. Yau, ACM SIGCOMM, pp. 281-293, 
1994. These techniques, however, have not considered bandwidth 
renegotiation. 



15 system and method for real-time transmission of VBR MPEG video 
traffic with consistent quality. 

According to one aspect of the invention, a variable bit rate 
video system comprises a video encoder for encoding video data, an 
encoder buffer for storing data encoded from the video encoder, an 

20 encoder rate controller for estimating the number of bits quantized from 
a target quantization parameter during a frame of the video data 
corresponding to discrete cosine transform (DCT) coefficients of the 
frame, and for providing quantization parameters based on slice level to 



SUMMARY OF THE INVENTION 



The object of the present invention is to provide an efficient 
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the encoder, a channel rate controller for generating parameters for 
smoothing and bandwidth renegotiation corresponding to the number of 
bits generated from the encoder rate controller, a network for generating 
negotiated parameters corresponding to the parameters generated from 
the channel rate controller, and a Leaky Bucket counter for transmitting 
the video data stored the encoder buffer through the network. 

In a system for transmission of variable bit rate MPEG video 
traffic with consistent quality, each frame is encoded with quantization 
parameter generated from an encoder rate controller, and an encoded 
data is transmitted to the encoder buffer. Transmission rate about each 
frame interval is determined by the channel rate controller at the 
beginning of the frame interval. Data transmission buffered at the 
encoder buffer is first regulated by the Leaky Bucket counter, and the 
regulated data is transferred to a decoder buffer in a video receiving 
system through network. 

In another aspect of the invention, a method for transmission of 
variable bit rate MPEG video traffic with consistent quality, comprises 
the steps of: encoding frames of video data and generating encoding 
information associated therewith; estimating a number of bits quantized 
from a target quantization parameter during a given frame of said video 
data based on encoding information of the given frame; estimating a 
maximum number of bits capable of being generated based on network 
traffic parameters and buffer sizes; computing a target bit rate using the 
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estimated number of bits quantized with the target quantization 
parameters and the estimated maximum number of bits; generating 
quantization parameters to control the actual encoding rate, if necessary, 
so that a number of actual bits generated does not exceed the target bit 
5 rate; and dynamically negotiating with a network to generate traffic 
parameters for smoothed traffic and required bandwidth. 

These and other aspects, features, and advantages of the present 
invention will become apparent from the following detailed description 
of the preferred embodiments, which is to be read in connection with 
10 the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWING 
The Figure is a block/flow diagram illustrating a system/method 
for transmitting and receiving VBR video according an embodiment of 
11] the present invention. 

JS 15 DESCRIPTION OF PREFERRED EMBODIMENTS 

In one embodiment of the invention, a system and method are 
provided that employs both a joint encoder and channel rate control 
protocol that complies with negotiated traffic parameters and 
constraints imposed by encoder and decoder buffers. To maintain a 
20 consistent visual quality, a target quantization parameter (target Q) that 
is applied to each frame is selected based on factors such as encoder 
and decoder buffer sizes and delay. Low target Q becomes more 
frequently applied by encoder rate control, and resulting in significant 
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fluctuations of visual quality of transmitted video. 

For a bit rate control method, the present invention affords an 
efficient method for selecting target Q and dynamic bandwidth 
renegotiation for smoothed traffic. In a preferred bandwidth 
renegotiation method, the traffic parameters for the smoothed traffic 
and required bandwidth are estimated in real-time, and dynamically 
renegotiated with networks. A preferred traffic smoothing method 
comprises analyzing the conditions of the traffic parameters for 
transmission of VBR MPRG video without delay, in which peak rate 
can be smoothed to the sustainable rate. A coding result of 
unconstrained VBR or CBR with same average bandwidth are compared 
with the proposed algorithm using simulations. Simulation results of 
the present invention have shown that a consistent quality video can be 
transmitted in accordance with the preferred algorithm. 

It is assumed that the consistent visual quality of transmitted 
video can be achieved whenfSimilar target quantization parameters are 
applied to the same type of frames within a scene. A preferred system 
and method for transmitting/receiving VBR MPEG video will now be 
described in further detail. 

The Figure is a block diagram illustrating a system for 
transmitting and receiving VBR video data in accordance with an 
embodiment of the present invention. It is to be understood that the 
Figure further depicts a method for transmitting and receiving VBR 
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video data according to one aspect of the present invention. The 
system comprises a VBR video transmission system 100 and a VBR 
video receiving system 300. The transmission system 100 comprises a 
video encoder 110, an encoder buffer 120, a Leaky Bucket (LB) counter 
150, an encoder rate controller (ERC) 130, and a channel rate controller 
(CRC) 140. The VBR video receiving system 300 comprises a decoder 
buffer 310 and a video decoder 320. The ERC 130 and CRC 140 
control the rate generated from video encoder 110 and the rate 
transmitted from a network 200, respectively, satisfying the imposed 
constraints. 



provided by the encoder rate controller 130. Coded data from the 
video encoder 110 is transferred to the encoder buffer 120. 
Transmission rate for each frame interval is determined by the channel 
rate controller 140 at the beginning of the frame interval. The 
transmission of buffered data is first regulated by Leaky Bucket (LB) 
counter 150, and regulated data is transferred to the decoder buffer 310 
via network 200. It is assumed that a delay within the network is 
constant and, therefore, the delay within the network can be ignored for 
purposes of designing rate control protocol according to the invention. 

In accordance with a preferred embodiment of the invention, the 
system selects an appropriate set of target quantization parameters for 
each frame type. The joint encoder and channel rate control comply 



Each frame is coded with a quantization parameter 
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with the constraints imposed by the encoder and decoder buffer sizes 
and the negotiated traffic parameters. 

The following notations are used herein for the description. Q tgt 
= Qi, Qp, Qb comprises a set of target quantization parameters, where Q h 
5 Q P , and Q B are the parameters associated with the I-frame, P-frame, and 
B-frame, respectively. T denotes time duration corresponding to an 
uncoded frame. The terms Ei and Rj respectively represent the number 
of bits generated and transmitted during the interval [(i-l)T, iT]. E ; + 
and Ri + denote the maximum number of bits capable of being generated 
10 and transmitted during i-th frame period, respectively. E f denotes 
the target number of bits to be generated during i-th frame period. 
Ei(Qtgt) and %(Q gl ) denote the actual and estimated number of bits as 
being quantized with target quantization parameters Q tgt during i-th 
frame, respectively. D represents delay within frames, that is, after the 
15 decoder begins to receive data, it waits DT seconds before starting to 
decode. Bi e and Bi d denote the buffer states in accordance with bits of 
encoder and decoder at the end of the i-th frame period t=iT, 
respectively. B c max and denote the buffer sizes in bits of encoder 
and decoder, respectively. L max denotes the size of the Leaky Bucket 
20 (LB) counter in bits. Lj is the state of LB counter in bits at the end of 
the i-th frame period t=iT. 

The operations of the video transmission system 100 are as 
follows. It is assumed that the i-th frame is available for being coded. 
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The encoder rate controller (ERC) 130 estimates %(Q & ) when the frame 
is encoded with target quantization parameters Q tgt by using information 
obtained from the current frame such as discrete cosine transform 
(DCT) parameter as well as previously coded frame. The ERC 130 
calculates the target bit rate E tgt by using EfQ gt ) and Ej + provided from 
the channel rate controller (CRC) 140. Then, the ERC 130 controls the 
actual encoder rate (which preferably does not exceed the target rate) 
by appropriately assigning quantization parameter on the basis of slice- 
level, i.e. q ik ,(k = 0X...,N sllce -\). 

The CRC 140 estimates traffic parameters for smoothing and 
bandwidth renegotiation by using Ej(Q tgl ), U=i-G N+ i, I] that is 
provided from the ERC 130. If necessary, the CRC 140 renegotiates 
with networks for the newly required bandwidth. And the CRC 140 
determines the actual rate transmitted from encoder buffer 120 by using 
the calculated upper bounds Ei + and Rj + satisfying the constraints 
imposed by encoder and decoder buffers and the negotiated parameters. 
If necessary, bits are filled in order to avoid underflow within the 
encoder buffer 120. 

The Leaky Bucket (LB) counter 150 is a device for policing 
traffic, which increases Rj bits and decrease R s bits during the i-th 
frame coding period. When the LB counter is full, predetermined data 
transferred from the encoder buffer are discarded. For example, if Rj = 
R s (or K p = 0), the LB counter size L max = 0, as the result, the channel 

10 
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rate is limited to R s . 

The video decoder 320 starts decoding i-th frame at t=(i+D-l)T, 
and completes the decoding at t=(i+D)T. Therefore, all data of the i-th 
frame should be transmitted to the decoder buffer 310 by t=(i+D)T. 

The constraints on encoder and channel rate are described in 
detail in the above-identified references by Reibman and Haskell. A 
preferred range of encoder and channel rates imposed by traffic 
parameters of buffer sizes of the encoder and decoder is as follows. 

The video decoder 320 waits during D frame periods to start 
decoding after data begins to be received. The encoder rate E; should 
be transmitted before t=(i+D)T. When the channel rate is Rj, the buffer 
states are described as equations (1) and (2). 

b;=bi 1 + e,-r i , o<5;<Cx (0 

Bf = BU + R, - E t . Dt o (2) 
where B*=B?=0 and E,=R,=0 for i<0. 

The traffic policer comprises an LB counter 150 and detects a 
state of the virtual buffer inside the network 200. The LB counter 
receives Rj bits and outputs R s during the i-th frame period. When the 
LB counter size is L max , the LB counter state at the end of the i-th frame 
period t=iT is 

L, = min{Z, max , max{0, + R, - R, }} (3 ) 

where i=l,2,... and Lo=0. 

The system cannot store more than 5* ax + B J max bits at any given 
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time. If the number of bits coded from successive D frames is larger 
than the total size of system buffers B e mm + B^ , some coded data may 
be lost by the buffer overflow. Therefore, the following relationship is 
valid for iDD. 

fx,^,;^! (4) 

7=0 

The above relationship is readily obtained from Equations (1) and 
(2). It is noted that the encoder rate (or the target quantization 
parameter) and the delay are closely related to the encoder buffer 120 
and decoder buffer 310 sizes. 

If the channel rate is selected within the maximum rate allowed 
by the decoder buffer 310 and LB policer, the delay constraints can be 
guaranteed. From equations (2) and (3), the maximum number of bits 
Ri + capable of being transmitted during the i-th frame period can be 
represented as follow. 

R; =min{5l -BU+E^L^ -L,. x + R,} (5) 
As mentioned above, the encoder rate E t preferably satisfies the 
conditions of the decoder buffer overflow at decoding time as well as 
the encoder buffer overflow at encoding time. Therefore, the 
maximum number of bits E* capable of being generated by video 
encoder can be obtained by equations (1) and (2) as follows. 
E; = minfiC, - BU + R! , 5,1 0 -i + i£ 0 } (6) 
where Bf +D _ x and R. +D can be calculated by using Ej, j=i, i+1, 
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i+D-1 and equation (5), as described in the above-identified reference 
by Reibman. Additionally, the actual encoder rate Ej and the actual 
channel rate Rj preferably satisfy the encoder underflow condition, 
B°_ x < However, when the encoder buffer underflow is expected, 

it can be avoided by stuffing bits appropriately. 

If the number of bits coded from successive D frames are larger 
than the system buffer size, B L max +£^ x , some frames following them 
should be coded with larger quantization parameters than the set of 
target quantization parameter Q tgt so as to avoid loss of coded data. 
This causes degradation of visual quality of transmitted video, and as a 
result, an unintentional fluctuation of the quality of video may occur. 
The fluctuation can be avoid by the right selection of Q tgt considering 
the system buffer size. 

It is assumed that D is less than or equal to the GOP (group of 
picture) size. When one of D frames is an I-frame, the largest buffer is 
preferred. The D frames comprise one I-frame, N P P-frames, and N B 
B-frames. If the number of bits of the I-frame, P-frame, and B-frame 
are ratios of x:y:z, respectively, all frames will have a similar visual 
quality, and the number of bits capable of being allocated to I-frame is 
given as follows. 




(7) 



It is noted that x, y, and z are empirically selected, such as 
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x:y:z=4:2:l. In the reference "An Accurate Control of MPEG Video By 
Rate-Codewords Modeling," by T. Y. Kim and J. K. Kim, Proc. 
ISCAS'97, pp. 1261-1264, 1997, a bit rate estimation model 
(transmission rate) and an actual bit rate control algorithm (actual 
transmission rate control algorithm) are based on the linear relationship 
between codeword count and real bit count, and the target quantization 
parameter for I-frame is given as follow. 

Q;' m r\cXQ,)-C,(E,)\ (8) 

where Cj(Qi) means the number of codewords generated by using 
the I-frame quantization parameter Q[ calculated from the pre-analysis 
of discrete cosine transform (DCT) coefficients, and C,.(£,) is the 
number of codewords estimated from the target bit count Ej given by 
equation (7). That is, 

C,{E,) = ^- (9) 
a 

where a and (5 are model parameters calculated from the coded 
results of previous frame with the same type. The P-frame and B- 
frame quantization parameters Q P and Q B are determined, so that the 
peak signal to noise ratio (PSNR) of P-frame and B-frame may be 
equivalent to I-frame. According to the present invention, {Q h Q P , 
Q B } = {3 5 4, 5} for QI < 3, {Q,, Q P , Q B } = {Qi, Qi+U Qi+2} for 4 < Q, < 9, 
and {Q,, Q P , Q B }={Qi, Qi+2, Q,+4} for 10 < Q,. 
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It is noted that the set of target quantization parameter Q tg { should 
be selected before each frame is coded. Indeed, whenever there are 
scene changes, modification of the quantization parameters using the 
above procedure would be very efficient for keeping consistent visual 
5 quality. Furthermore, even when there is no scene change, if many 
frames are coded by quantization parameters different from the Q tgt , the 
Q tgt should be modified. 

It is assumed that E^Q^) and E t {Q tgt ) are the actual and 
estimated number of bits quantized with Q tgt fro the i-th frame, 
10 respectively. In case of test model 5 (TM5), ) = A7 0 lgf , where X 

is global complexity of the previous frame with the same type. In the 



in ~ 

f|| above reference by Kim et al., the number of bits EXQ t& ) is estimated 

s exactly by using the linear relationship between the number of 

is; 

codewords and the estimated number of bits as follows. 

ru 

gj 15 E i (Q lgl ) = <*C i (Q lgl ) + /3 (10) 



where a and (5 are same as the parameters of equation (9). In 
case of ideal estimation, EXQ m ) is equal to E^Q^). The method by 
the linear relationship provides much more excellent estimation 
performances than the TM5 proposed by the paper of Kim et al. In a 
20 preferred embodiment of the present invention, the bit rate estimation 
model disclosed by Kim et al is extended for the encoder rate control 
method. 

If the i-th frame is coded with the Q tgt , the number of bits £,(2 W ) 
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is generated and the encoder is controlled so as to be E i <E* . As a 
result, if E^Q^^E*, a target bit rate Ef becomes E^Q^), and if 
Ei(Qi 8 r) > E * > E f becomes . Then the encoder is controlled so that 
E i <Ef . However, because of the inaccuracy of encoder rate control 
5 generated from inaccurate bit rate estimation, the target bit rate Ef is 
preferably estimated as follows. 

If £,(G w )<a£, + , Ef =1^) and quantize with Q tgt (11) 
and if E^Q^^cxE* , Ef =aE* and control the encoder rate 
where the constant a is selected so that the actual bit rate does 

n 

$ 10 not exceed the bit rate bound, i.e. < E. , and in the present invention, 

m 

j u it is preferred that 0.9 is used for the constant a. It is preferred that, 

for VBR MPEG video, all frames are coded with the target parameter 
Q t g t to maintain consistent visual quality from frame to frame. 
However, because of several constraints imposed by traffic parameters 
15 and buffer sizes, the actually generated bit counts Ej may be different 
from the real bit counts Ei(Q tg t) coded by using the Q tgt . If the encoder 
rate is controlled so that E i <E* , then overflows within the encoder 
buffer can not occur. Underflows that may be generated at £, « £ ; + can 

be solved by stuffing bits. 
20 However, if £, > E- 9 some data may be lost by overflows in the 

encoder buffer or underflows in the decoder buffer. This is preferably 
solved by appropriately selecting quantization parameters of each slice 
in a frame as follows. 
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If q i)k is the actually applied quantization parameter to k-th slice 
in the i-th frame, k=0, 1, N S | ice -l and the N s i ice means the number of 
slice in a frame. The q i)k comprises q ik and Aq ik , where q ik is an 
initial parameter determined by Ef and Aq ik is an additional 
5 parameter selected by considering the bit counts generated until (k-l)th 
slice. 

First, it is required to determine the value of q ik for slice k. If 
Ef = E t {Q tgt ), q ik sets the target quantization parameter Q tgt , and if 
Ef =aEf , q ik is properly selected so as to meet the target bit rate Ef 
10 by using the method proposed in the paper of Kim et al. The above 
two cases are derived from equation (11). 

After determining the value of q ik for slice k, Aq ik is 
determined. The Aq ik is selected by considering the actually 

generated bit counts and the upper bound of bit counts to (k-1) slice in 
the i-th frame. If E*l is the maximum number of bits for slice k in the 
i-th, Aq ik can be obtained as follows. 
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Aq ik =min { 3 1 , Aq ik _ { + 1 } , i f £ E u > £ £+, 

\j=0 j=0 J 



(12) 



A^=max{0, Aq^ -1}, if [l^XX^ 



where Aq i0 = 0, Eg is the actually generated number of bits from slice j 
20 in the i-th frame. Finally, the actual quantization parameter q it k is 
given by 



17 



8836-135 (IB10153-US) 



q ik = max {1, min {3 1, q ik + Aq ik } } (13) 
where the value of q it k is preferably limited to between 1 and 31 
because q lt k i§ preferably used as a quantization step size in TM5. 

The number of bits capable of being transmitted will be limited to 
/?, < B;_ { + E { from the condition of encoder underflows. Because the 
actual encoder rate Ej cannot be known at the starting time of encoding 
of the i-th frame, the channel rate is redefined by the target number of 
bits Ef of equation (11) as follows. 

% r =min{B' l +£*',2?1 - Bf_ x + -I M + *,} (14) 

The encoder rate is controlled so that £, = . As a result, the 
encoder buffer underflows that may occur as R i - 5*., + Ef can be 
solved by stuffing bits. 

The channel rate R\ that is computed as indicated above is the 
maximum rate allowed by the decoder buffer and LB counter. This 
method advantageously causes the encoder buffer 120 maintain large 
empty area as described in equation (6) of 2?* ax -5,1, . The empty area 
enlarges the upper bounds of encoder rate. If the encoder buffer 
becomes full (B^ is nearly equal to #* ax ), E* is limited to # ( + . 
Therefore, control of the encoder rate is less frequently applied by the 
larger upper bounds of encoder rate, resulting in more consistent quality 
video. 

Since asynchronous transfer mode (ATM) network provides a 
bandwidth constrained by a limit of available bandwidth or traffic 
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contracts with each connection, it is inevitable for the connection to 
transmit constrained VBR. For the control of the constrained VBR, 
ATM forum specifies parameters such as peak rate Rp, sustainable rate 
Rs 5 and maximum burst length MBL. 

The maximum bandwidth for transmitting data that is coded with 
Q tgt to guarantee the transmission of VBR MPEG video closely related 
to visual quality needs to be reserved within the network 200. It can 
be achieved by reserving the bandwidth as a peak rate of the video 
traffic. However, since a video sequence consists of each scene 
together with very different characteristics of generating bits, low 
network utilization is inevitable. 

The smoothing (or shaping) is one of the most promising methods 
to reduce the peak rate of the traffic, and efficiently improve network 
utilization. However, if the smoothing process does not take into 
consideration various probable characteristics of the traffic, the 
problem of low network utilization still remains. The bandwidth 
renegotiation is another important method for overcoming the problem. 
The present invention provides methods for smoothing and bandwidth 
renegotiation. 

Since frame types periodically appear based on a pre-defined 
group of pattern (GOP) of a MPEG video, traffic pattern generated from 
the MPEG video source is basically determined at the GOP level. Thus, 
to make a decision of renegotiation, it is reasonable to select an interval 
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for monitoring the traffic during a multiple of GOP size. Preferably, 
one GOP period comprising G N frames is utilized for monitoring the 
traffic. 

Eavg(Qtgt) and E max (Q tg t) denote the average number and maximum 
number of bits per frame of GOP period including the current frame i, 
respectively. The average number and maximum number of bits per 
frame after the i-th frame are preferably coded as follows. 

^xa^=7^zW>(a,,) (i5) 

£™ (0 W ) = K m x maxfc, {Q m ), (fi w ),..,£,_ Gv+1 )} 
where K m (>l) is a constant, and the purpose of the constant K m is 
to offer a margin for selecting the maximum bit rate £ max (0 w ). For 
example, if K m = 1 5 E max (Q tgt ) is same as the maximum bit rate during 
the GOP period, and if K m = 2, E mgK (Q t& ) becomes twice of the 
maximum bit rate of the GOP period. The generated number of bits 
Ej(Qtgt) O'-i-G^+l, i) in equation (15) is determined as Ej(Q tgt )=Ej 
(for i-G}f+l<j <i) if the j-th frame was coded with Q tgt , otherwise 
E AQ< st ) = Ej(Q t8t )- Ej(Q t8 ,) = Ej(Q« t ) is defined as an absolute value 
because the i-th frame was not coded. 

R s and R p denote the normal sustainable rate and peak rate for 
the passed GOP period including the current frame i, respectively. R s 
and R p are preferably given by: 

I = E avg (Q tgt ) + K s jE M (Q tgt ) - E avg (Q v )} (16) 



20 



836-135 (IB10153-US) 



R p =R s +K p {E max (Q lg ,)-Rsl 



(17) 



where K s and K p are constants satisfying the conditions of 0<K S <1 
and 0< K p < 1, respectively. Thus, R s and R p satisfy the conditions 
of E^iQ^^R^E^iQ^,) and R s .< R p < £ inas (& K ,), respectively. If 
K p =l, the traffic is not smoothed, i.e. R p = E imx {Q vl ) , and defined as 'no 
smoothed VBR'. While if K P <1, the maximum rate E^iQ^) is 
smoothed to R p (< E max (Q ml )) . In this case, smoothing buffers are 
preferably implemented. 

The object of bandwidth renegotiation is to dynamically allocate a 
specific bandwidth that varies based on scene content. A preferred 
index of traffic variation trend UPC var (=0, 1,2, ...) that is given by 
comparing the normal sustainable rate R s together with the current 
sustainable rate R s is as follows. 

If (R x < (\-b)RJ, decrement UPC var (18) 

else if (( 1 +b)R s < R s ), increment UPC var 

else if (0 < UPC var ), decrement UPC var 

else if (UPC var < 0), increment UPC var 

else UPC var = 0 

where 0<b<l and UPC var is initially set to zero. Whenever the 
bandwidth will be renegotiated, or not renegotiated, the bandwidth is 
preferably determined by comparing UPC var and a predefined threshold 
value X(>0). That is, 



if \UPCj>X, 
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bandwidth renegotiation by {R s ,R p ,MBL} = {R s ,R pi R p } (19) 
It is noted that the values of b and X determine a rate of 
bandwidth renegotiation. If the value of b is close to 1 or X is set to a 
large value, the bandwidth renegotiation hardly occurs. The time of 
bandwidth renegotiation T neg0 is the frame periods consuming between 
the beginning time and the end time of the bandwidth renegotiation. It 
is assumed that the bandwidth renegotiation is carried out from the next 
I-frame until before T neg0 . It means that the current traffic parameters 
are replaced by new traffic parameters at the time of the next I-frame. 
In simulations, if D<3 frames, T nege = 3T 5 otherwise T neg0 = DT. 

The LB counter size based on the negotiated traffic parameters is 
given by 



It means that the maximum burst length MBL in peak rate equals 
to the negotiated bit rate R p , i.e. MBL = R p when Lj=0. L max is an 
adequate size of LB counter in order that the LB counter is increased by 
R s = R p bits and decreased by R s bits during frame period i. 

When a scene change occurs from simple to complicated scene, 
the current traffic parameters may be affected even though a changed 
frame is quantized with a large quantization parameter. This is 
because the previous traffic parameters are used for the changed frame 
that has different characteristics from the previous scene. To solve the 
problem, a lower bound of sustainable rate R s is preferably defined. 




(20) 
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VBR traffic may be transmitted without delay ('no delay VBR'), 
or delayed in some parts by a smoothing mechanism ('smoothed VBR'). 

First, in the case of no delay VBR', ideal transmission of VBR 
traffic requires no delay, i.e. no buffer, and the 'no delay VBR' can be 
achieved when a limitless bandwidth is provided to the traffic. The 
limitless bandwidth can guarantee the transmission of the traffic at its 
peak rate. An allocation of the peak rate leads to inefficient utilization 
of network resources. The inefficiency can be improved by adapting a 
bandwidth renegotiation method according to the invention. 

The conditions for 'no delay VBR' transmission comprise D=0 
an d B e max + B^ ax =0. The conditions of traffic parameters for the 
bandwidth renegotiation are preferably as follows. 

To transmit the coded data Ei(Q tgt ) with no delay, the upper bound 
of transmission rate is preferably R+ = min{£ ( .(g w ),L max + R s } from 
equation (5). Therefore, if E^) < I max -Z M R;=E t (Q^) for all 

i. It is noted that E, {Q tgt ) < £ nm {Q m ) and R s < L m ^ - R s < Rp . If 
E^{Q tgt )<L^-L t _,+R s =R p for £,(0 W ) = and if the average 

of A™ -4-1 + R s> G^-Gn+i, i) is greater than E a ^(Q li9 ) for any GOP, 
all data can be transmitted without delay. From equations (15) and 
(17), a proper condition for 'no delay VBR' are K m > 1, K s > 0, and K p 
= 1 with the requirements D=0 and B c mAX = 0. If the negotiated 
traffic parameters are R s , R p , MBL, then the size of LB counter is R p -R s 
from equation (20) and the upper bound of transmission rate is 
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r; = mm{E„L max - I,., + R s } from equation (5). As a result, the encoder 
rate is preferably controlled so that £, < /?, + . 

In the case of 'smoothed VBR', smoothing is a preferred method 
to protect traffic flows from being lost as well as to improve network 
5 utilization. The present invention provides a traffic smoothing method 
that is capable of reducing a peak rate to a sustainable rate. It is noted 
that the traffic smoothing requires D > 0 and B L max + B'^ x >0. 

From equations (16) and (17), there are two specific cases that the 
transmission rate Ri is always less than or equal to the peak rate R p , and 
10 the average of the R ; is less than or equal to the sustainable rate R s . 

When K p = 1, the peak rate R p is greater than or equal to the sustainable 
rate R s . When K p = 0, the upper bound of transmission rate Ri can be 
determined by R p = R s . In other words, the transmission rate Rj is 
preferably smoothed to the sustainable rate R s where D > 0. The 
15 'smoothed VBR' is defined as K p = 0 and D > 0. IfK p <l,the 

maximum bit rate £ max (£>„„ ) is smoothed by equation (17). In the case 
of 'smoothed VBR', since L max = 0 from equation (20), the upper bound 
of channel rate of equation (5), i.e. the maximum number of bits 
capable of being transmitted during i-th frame period R* is given by 
20 R! =mm{B d m3X -Bi l+ E i . D ,R s } (21) 

To transmit the coded data £,(0, x ,) with Q lgt at /?,(< fl v ), the 
condition £,(0, g ,) = £, + is preferably satisfied. When #;',=(), the 
maximum number of bits R. is limited to the sustainable rate R s from 
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equation (21). When R+ =R S , (j=l, i+1, i+D), the largest data can 
be transferred by channel. In this case, the condition 

E i(Qt & ) = E ! = b ?+d-\ + ^.v f rom equation (6) is given by 



Equations (4) and (22) are conditions selected so that all E ( (Q (gt ) 
can be transmitted at the smoothed rate R i (= R s ) . Since the smaller D 
makes R s be more closed to E max (Q tgl ), the constant K s should be closed 

to 1. When the delay D is assumed to be equal to the total length of 
sequence, all frames are coded with Q tgt and then transmitted to 
constant bit rate E nv (Q tal ) . 

avg ^z^tgt * 

For example, a MPEG sequence is assumed to be GOP structure 
of IBPBPBPBPBPB. In case that the delay is D frame, among several 
successive (D+l) frames satisfying the conditions of equations (4) and 
(22), when I-frame is included in the (D+l) frames, the maximum 
number of bits is generated. It is assumed that the GOP structure is 6 1, 
2, 3\ When D = 3 and the data ratio of I-frame, P-frame, and B-frame 
for consistent visual quality is x:y:z, the smoothed sustainable rate R s = 
(x+y+2z)U/4, where U is constant for reflecting the ratio. If x:y:z = 



4:2:1, E max (Q lgt )(= *) : E avg (Q lgt ) :R S .= 4 : 5/3 : 2 . In this case, the smoothed 
rate or sustainable rate R s is half of E m ^(Q lgl ) and 20 % of E avg (Q lgt ) . 
If D = 5, R s is 10% larger than E avg (Q tgt ) . 

According to the present invention, a variable bit rate MPEG 



D 



Y,E H (Q lKl )<R s (D + l) 



(22) 
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video with consistent quality can be transmitted in real time. 

Although illustrative embodiments have been described herein 
with reference to the accompanying drawings, it is to be understood that 
the present invention is not limited to those precise embodiments, and 
that various other changes and modifications-may be affected therein by 
one skilled in the art without departing from the scope and spirit of the 
invention. It is to be understood that all such changes and modifications 
are intended to be included within the scope of the invention as defined 
by the appended claims. 
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